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Abstract 



o. 

CNj , A static binary search tree where every search starts from where the previous one ends (lazy finger) 

t ! ■ is considered. Such a search method is more powerful than that of the classic optimal static trees, where 

Oh' every search starts from the root (root finger), and less powerful than when rotations are allowed — 

•^C where finding the best rotation based tree is the topic of the dynamic optimality conjecture of Sleator 

and Tarjan. The runtime of the classic root-finger tree can be expressed in terms of the entropy of 

the distribution of the searches, but we show that this is not the case for the optimal lazy finger tree. 

A non-entropy based asymptotically-tight expression for the runtime of the optimal lazy finger trees is 

derived, and a dynamic programming-based method is presented to compute the optimal tree. 



1 Introduction 

1.1 Static trees 



A binary search tree is one of the most fundamental data structures in computer science. In response to 
a search operation, some binary trees perform changes in the data structure, while others do not. For 
example, the splay tree |18j data structure performs a sequence of searches that moves the searched item 
to the root. Other binary search tree data structures do not change at all during a search, for example, 



\Q ', red- black trees [33] and AVL trees [Tj. We will call BSTs that do not perform changes in the structure 



during searches to be static and call trees that perform changes BSTs with rotations. In this work we do 

(~} [ not consider insertions and deletions, only searches, and thus can assume without loss of generality that all 

£T) • structures under consideration are storing the integers from 1 ton and that all searches are to these items. 

We consider two variants of static BSTs: root finger and lazy finger. In the classic method, the root finger 

method, the first search proceeds from the root to the item being searched. In the second and subsequent 

searches, a root finger BST executes the searches in the same manner, always starting each search from the 

r> . root. In contrast, here we consider lazy finger BSTs to be those which start each search at the destination 

of the previous search and move to the item being searched. In general, this movement involves going up to 

the least common ancestor (LCA) of the previous and current items being searched, and then moving down 

from the LCA to the current item being searched. 

1.2 Notation and definitions 

A static tree T is a fixed binary search tree containing n elements. No rotations are allowed. The data 
structure must process a sequence of searches, by moving a single pointer in the tree. Let r(T,i,j) be the 
time to move the pointer in the tree T from node i to j. If dx(i) represents the depth of node i, with the 
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root defined as having depth zero, then 

r(T,i,j) = d T (i) - d T (LCA T (i,j)) + d T (j) - d T (LCA T (t,j)) 
= dr(i) + Ml) - 2dr(LCA T (i, j)). 

The runtime to execute a sequence X = xi,X2, ■ ■ ■ x m of searches on a tree T using the root finger method 
is 



R r (T,X) = Y j r(T,root(T),x. l ) = J2 d T&) 
and the runtime to execute the same sequence on a tree T using the lazy finger method is 

m / m \ 

Re{T,X) = ^2r{T,Xi-i,Xi)= 2^(d T (^) - dr(LCA T (x;,a;;-i))) - d T (x m ) 

i=\ \ i=\ ) 

where xq is defined to be the root of T, which is where the first search starts. 

1.3 History of optimal static trees with root finger 

For the root finger method, once the tree T is fixed, the cost of any single search in tree T depends only on 
the search and the tree, not on any of the search history. Thus, the optimal search tree for the root finger 
method is a function only of the frequency of the searches for each item. Let fx (a) denote the number of 
searches in X to a. Given fx, computing the optimal static BST with root finger has a long history. In 1971, 
Knuth gave a 0(n 2 ) dynamic programming solution that finds the optimum tree [15] ■ More interestingly is 
the discovery of a connection between the runtime of the optimal tree and the entropy of the frequencies: 

fx(a) 






fx(a) 



Melhorn |16| showed that a simple greedy heuristic proposed by Knuth |15j and shown to have a 
linear-time implementation by Frcdman |llj produced a static tree where an average search took time 
^ "^ 1 — i~77f -.s H(fx)- Furthermore, Melhorn demonstrated a lower bound of y^H(fx) for an average 
search in an optimal static tree, and showed this bound was tight for infinitely many distributions. Thus, 
by 1975, it was established that the runtime for an average search in an optimal search tree with root finger 
was 0(H(fx)), and that such a tree could easily be computed in linear time. 

1.4 Our results 

We wish to study the natural problem of what we have coined search with a lazy finger in a static tree, i.e. 
have each search start where the last one ended. We seek to characterize the optimal tree for this search 
strategy, and describe how to build it. 

The lazy finger method is asymptotically clearly no worse then the root finger method; moving up to 
the LCA and back down is better than moving to the root and back down, which is exactly double the cost 
of the root finger method. But, in general, is the lazy finger method better? For the lazy finger method, 
the cost of a single search in a static tree depends only on the current search and the previous search — this 
puts lazy finger's runtime dependence on the search sequence between that of root finger and trees with 
rotations. Thus the optimal search tree for the lazy finger method only depends on the frequency of each 
search transition; let fxifl, b) be the number of searches in X to b where the previous search was to a. Given 
these pairwise frequencies (from which the frequencies fx{a) can easily be computed), is there a nice closed 
form for the runtime of the optimal BST with lazy finger? One natural runtime to consider is the conditional 
entropy: 



to = EE 



fx(a,b) f x (a) 



m fx(a,b) 



a=l 6=1 

This is of interest as information theory gives this as an expected lower bound_J if the search sequence is 
derived from a Markov chain where n states represents searching each item. 

While a runtime related to the conditional entropy is the best achievable by any algorithm parameterized 
solely on the pairwisc frequencies, however, we will show in Lemma [5] that the conditional entropy is impos- 
sible to be asymptotically achieved for any BST, static or dynamic, within any o(logro) factor. Thus, for 
the root finger, the lower bound given by information theory is achievable, for lazy finger it is not related in 
any minimal way with the runtime of the optimal tree. In Section [7] we will present a simple static non-tree 
structure whose runtime is related to the conditional entropy. 

This still leaves us with the question: is there a simple closed form for the runtime of the optimal BST 
with lazy finger? We answer this in the affirmative by showing an equivalence between the runtime of BSTs 
with lazy finger and something known as the weighted dynamic finger runtime. In the weighted dynamic 
finger runtime, if item i is given weight Wi, then the time to execute search Xi is 
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Our main theorem is that the runtime of the best static tree with lazy finger, LF(X), is given by the weighted 
dynamic finger runtime bound with the best choice of weights: 
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To prove this, we first state the result of Seidel and Aragon [17] in Section [2] of how to construct a tree 
with the weighted dynamic finger runtime given a set of weights. Then, in Scction[3j we show how, given any 
static tree T, there exists weights such the runtime of T on a sequence using lazy finger can be lower bounded 
using the weighted dynamic finger runtime with these weights. These results are combined in Section 0] to 
give the main theorem. 

While a nice closed-form formula for the runtime of splay trees is not known, there are several different 
bounds on their runtime: working set, static finger, dynamic finger, and static optimality [7HHHH]. One 
implication of our result is that the runtime of the optimal lazy finger tree is asymptotically as good as that 
of all of the aforementioned bounds with the exception of the working set bound (see Theorem [5] for why 
the working set bound does not hold on a lazy finger static structure). 

However, while these results have served to characterize the best runtime for the optimal BST, a concrete 
method is needed to compute the best tree given the pairwise frequencies. We present a dynamic program- 
ming solution in Section [6j this solution takes time 0(n 3 ) to compute the optimal tree for lazy finger, given 
a table of size n 2 with the frequency of each pair of searches occurring adjacently. This method could be 
extended using the ideas of [TJ into one which periodically rebuilds the static structure using the observed 
frequencies so far; the result would be an online structure that for sufficiently long search sequences achieves 
a runtime that is within a constant factor of the optimal tree without needing to be initialized with the 
pairwisc frequencies. 



1 When multiplied by -r—^ , as the information theory lower bound holds for binary decisions and as observed in |16| needs to 
be adjusted to the ternary decisions that occur at each node when traversing a BST. 



1.5 Why static trees? 

Static trees are less powerful than dynamic ones in terms of the classes of search sequence distributions that 
can be executed quickly, so why are we studying them? Here we list a few reasons: 

• Rotation-based trees have horrible cache performance. However, there are methods to map the nodes 
of a static tree to memory so as to have optimal performance in the disk-access model and cache- 
oblivious models of the memory hierarchy [51I51 I121IT5] . One leading cache oblivious predecessor query 
data structure that supports insertion and deletion works by having a static tree and moves the data 
around in the fixed static tree in response to insertions and deletions and only periodically rebuilds 
the static structure [4] — in such a structure an efficient static structure is the key to obtaining good 
performance even with insertions and deletions. 

• One should use the simplest structure with the least overhead that gets the job done. By completely 
categorizing the runtime of the optimal tree with lazy finger, one can know if such a structure is 
appropriate for a particular application or whether one should instead use the more powerful dynamic 
trees, or simpler root-finger trees. 

• Concurrency becomes a real issue in dynamic trees, which requires another layer of complexity to 
resolve (see, for example [5]), while static trees trivially support concurrent operations. Moreover, if 
several search sequences from several sources are interleaved, any dependence the pervious operation 
is destroyed by the interleaving. However, it is easy to have each sequence have it own lazy finger into 
a static tree, that allows each search sequence to be executed concurrently while not losing the ability 
to take advantage of any distributional temporal cohesion in each search sequence source. 

2 Weights give a tree 

Theorem 1 (Seidel and Aragon [5]). Given a set of weights W = w\, w%, . . . w n , there is a randomized 
method to choose a tree Tyy such that the expected runtime is 



r(T w ,i,j) = lg 



Emax(z,7') 
, \ 'y, .. w k 

wm(wi,Wj) 



The method to randomly create Tw is a straightforward random tree construction using the weights: 
recursively pick the root using the normalized weights of all nodes as probabilities. Thus, by the probabilistic 
method [2], there is a deterministic tree, call it T m ; nw (X) whose runtime over the sequence X is at most the 
runtime bound of Seidel and Agraon for the sequence X on the best possible choice of weights. 

Corollary 2. There is a tree T minw (X) such that 

m I ( m y^max^i^i-i) 

y^r(T minw (X),Xi-i,Xi) = min< Y^lg 7""" z " x '" 1 . 

Proof. This follows directly from Seidel and Aragon, where T , m ; nw (A) is a tree that achieves the expected 
runtime of their randomized method for the best choice of weights. □ 

3 Trees can be represented by weights 

Lemma 3. For all trees T there is a set of weights W T — wf , iff, . . . w^ such that for all i, j 
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r(T,i,j) < lg 
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Proof. These weights are simple: give a node at depth dinTa weight of -^ . Consider a search that starts 
at node i and goes to node j. Such a path goes up from i to LC At (i,j) and down to j. A lower bound on 
£™=min(i j) w k * s the we ight of LC At (i,j) which is included in this sum and is dT(LC A T(i|j)) ■ Thus we can 
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= 2max(d T (i),d r (i)) - 2d T (LCA T (j,i)) 
> dr(t) + dr(i) - 2d r (LCA T (z,i)) 
= 2r(T,i,j) 
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Corollary 4. for any tree T 
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Proof. Follows directly from Lemma |3l summing over the access sequence X, and noting that replacing the 
weights W with minimum weights can only decrease the right hand side. □ 

4 Proof of main theorem 

Here we combine the results of the previous two sections to show that the runtime of the optimal tree with 
lazy finger, LF(T), is asymptotically the weighted dynamic finger bound for the best choice of weights. 

Theorem 5. 

Cm \ / ( m y^max(xi,Xi_i) 

Vr(T,x M ,x i )U U vig ^rf'^- 11 . fc 
j-[ J \ w \~i minKi-i.^J 

Proof. We start by defining T m [ n (X) to be the optimal tree. Let 

Tmin = argmin < VV(T, Xi-i,Xi) > . 
T Iti J 

Then, the runtime of T min is at most the runtime of any other tree, including T mmw : 

rn m 

i—1 i—1 

We now proceed with the main proof. By Corollary [2J 

(rn ^m&xixiiXi-x) *\ / m \ 
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By Equation ((TJ) 

= ^ y^ r(T min (X), Xi- 1: Xi) J 
Using Corollary 0J 
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5 Hierarchy and limitations of models 

In this section we show there is a strict hierarchy of runtimes from the root finger static BST model to 
the lazy finger static BST model to the rotation-based BST model. Let OPT(X) be the fastest any binary 
search with rotations can execute X . 

Theorem 6. For any sequence X , 

min R r (T,X) = Q (mm R e (T,X)) = Q(OPT(X)). 
Furthermore there exist classes of search sequences of any length to, X^ and X^ such that 

mm R r {T, X} n ) = uj (mm R t {T, X^) 



and 



mm R e (T, X? n ) = uj(OPT(Xi)). 



Proof. We address each of the claims of this theorem separately. 

Root finger can be simulated with lazy finger: minp Rr(T, X) = ft (miny Re(T, X)). For lazy finger, 
moving up to the LCA and back down is not more work than than moving to the root and back down, which 
is exactly double the cost of the root finger method. 

Lazy finger can be simulated with a rotation-based tree: mm? Ri(T, X) = Vt(OPT(X)). The 
normal definition of a tree allowing rotations has a finger that starts at the root at every operation and can 
move around the tree performing rotations. The work of [TD] shows how to simulate with constant-factor 
overhead any number of lazy fingers in a tree that allows rotations in the normal tree with rotations and 
one single pointer that starts at the root. This transformation can be used on a static tree with lazy finger 
to get the result. 

Some sequences can be executed quickly with lazy finger but not with root finger: There 
is a X} n such that mimr R r (T, X} n ) = uj (mhiT Ri(T, X^)). One choice of X^ is the sequential search 
sequence 1, 2, . . . n, 1, 2, . . . repeated until a search sequence of length m is created. So long a to > n, this 
takes time 0(m) to execute on any tree using lazy finger, but takes tt(m\gn) time to execute on every tree 
using root finger. 



Some sequences can be executed quickly using a BST with rotations, but not with lazy finger. 

Pick some small k, say k = lg n. Create the sequence X^ in rounds as follows: In each round pick k random 
elements from l..n, search each of them once, and then perform n random searches on these k elements. 
Continue with more rounds until at total of m searches are performed. A splay tree can perform this in time 
0(m lgfc). This is because splay trees have the working-set bound, which states that the amortized time to 
search an item is at most big-0 of the logarithm of the number of different things searched since the last time 
that item was searched. For the sequence X^, the n random searches in each round have been constructed 
to have a working set bound of 0(lgk) amortized, while the k other searches in each round have a working 

set bound of 0(lg n) amortized. Thus the total cost to execute X^ on a splay tree is O I -^-g (n lg k + k lg n) J 

which is 0(?nlglg7j) since k = lgn. 

However, for a static tree with lazy finger, X^ is basically indistinguishable from a random sequence 
and takes f2(mlgn) time. This is because the majority of the searches are random searches where the 
previous item was a random search, and in any static tree the expected distance between two random items 
isO(lgn). □ 

Lemma 7. A BST in any model cannot reach the conditional entropy. 

Proof. Wilber |20| proved that the bit reversal sequence is performed in 57 (n lg n) time in an optimal dynamic 
BST. This sequence is a precise permutation of all elements in the tree. However, any single permutation 
repeated over and over has a conditional entropy of 0, since every search is completely determined by the 
previous one. □ 

6 Constructing the optimal lazy finger BST 

Recall that f a j, = fx{&, b) is the number of searches in X where the current search is to b and the previous 
search is to a, and fxifl) is the number of searches to a in X . We will first describe one method to compute 
the cost to execute X on some tree T. Suppose the nodes in [a, b] constitute the nodes of some subtree of T, 
call it T a j, and denote the root of the subtree as r(T a jy). We now present a recursive formula for computing 
the expected cost of a single search in T. Let Re(T, X, a, b) be the number of edges traversed in T a jj when 
executing X. Thus, Re(T, X, 1, n) equals the runtime Re(T, X). There is recursive formula for Ri{T, X, a, b): 



R e (T,X,a,b) = { 
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The formula is long but straightforward. First we recursively include the number of edges traversed in the 
left (a) and right (b) subtrees of the root r(T 0] &). Thus, all that is left to account for is traversing the edges 
between the root of the subtree and its up to two children. Both edges to its children are traversed when a 
search moves from the left to right subtree of r a j, or vice- versa (c). A single edge to a child of the r(T a ^) 
traversed if a search moves from either the left or right subtrees of r(T a j,) to r(T a: i,) itself or vice- versa (d), 
or if a search moves from any node but the root in the current subtree containing the nodes [a, b] out to the 
rest of T or vice- versa (e). 



This formula can easily be adjusted into one to determine the optimal cost over all trees — since at each 
step the only dependence on the tree was is root of the current subtree, the minimum can be obtained 
by trying all possible roots. Here is the resultant recursive formulation for the minimum number of edges 
traversed in and among all subtrees containing [a, b] : 
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This formula can trivially be evaluated using dynamic programming in 0(n 5 ) time as there are 0(n 3 ) 
choices for a, b, and r and evaluating the summations in the brute-force way takes time 0(n 2 ). The dynamic 
programming gives not only the cost of the best tree, but the minimum roots chosen at each step gives the 
tree itself. The runtime can be improved to 0(n 3 ) by observing that when / is viewed as a 2-D array, each of 
the sums is simply a constant number of partial sum queries on the array /, each of which can be answered 
in 0(1) time after 0(n 2 ) preprocessing. (The folklore method of doing this is to store all the 2-D partial 
sums from the origin; a generic partial sum can be computed from these with a constant number of additions 
and subtractions). 

We summarize this result in the following theorem: 

Theorem 8. Given the pairwise frequencies fx finding the tree that minimizes the execution time of search 
sequence X using lazy finger takes time 0(n 3 ). 

This algorithm computes an optimal tree, and takes time liner in the size of the frequency table /. 
Computing / from X can be done in 0(m) time, for a total runtime of 0(m + n 3 ). It remains open if 
there is any approach to speed up the computation of the optimal tree, or an approximation thereof. Note 
that although our closed form expression of the asymptotic runtime of the best tree was stated in terms of 
an optimal choice of weights, the dynamic program presented here in no way attempts to compute these 
weights. It would be interesting if some weight-based method were to be discovered. 



7 Multiple trees structure 

Here we present a static data structure in the comparison model on a pointer machine that guarantees an 
average search time of 0(H c (fx)log d n) for any fixed value 1 < d < n, a runtime which we have shown 
to be impossible for any BST algorithm, static or dynamic. This data structure requires 0(dn) space. In 
particular, setting d — n e gives a search time of 0(H c (fx)) with space 0{n 1+£ ) for any e > 0. 

As a first attempt, a structure could be made of n binary search trees T\,T%, . . . T„ where each tree Tj 
is an optimal static tree given the previous search was to i. By using tree T Xil to execute search T,, the 
asymptotic conditional entropy can be easily obtained. However the space of this structure is 0(n 2 ). Thus 
space can be reduced by observing the nodes not near the root of every tree are being executed slowly and 
thus need not be stored in every tree. 

The multiple trees structure has two main parts. It is composed first by a complete binary search tree T" 
containing all of S. Thus the height of T" is O(lgn). The second part is n binary search trees {Ti,T2, . . . ,T n }. 
A tree T* contains the d elements j that have the greatest frequencies fx(hj)', these are the j elements most 



frequently searched after that i has been searched. The depth of an element j in T is 0(lg j ., f K ) ■ For each 
element j in the entire structure we add a pointer linking j to the root of Tj. The tree T" uses O(n) space 
and every tree Tj uses O(d) space. Thus the space used by the entire structure is 0{dn). 

Suppose we have just searched the clement i and our finger search is located on the root of Ti. Now we 
proceed to the next search to the element j in the following way: Search j in T^. If j is in Ti then we are 
done, otherwise search j in T'. After we found j cither in Tj or T" we move the finger to the root of Tj by 
following the aforementioned pointer. 

If j is in Ti then it is found in time 0(lg , i'l ). Otherwise if y is found in T", then it is found in 0(lg n) 
time. We know that if y is not in T x this means that optimally it requires O(lgd) comparisons to be found 
since T x contains the d elements that have the greatest probability to be searched after that x has been 
accessed. Hence every search is at most 0(lgn/lgd) times the optimal search time of 0(lg l x , ■ \ )■ Thus a 

search for Xi in X takes time O ( log d n lg f u x \ ) • Summing this up over all m searches Xi in X gives 
the runtime to execute X: 

= 0(mH c (fx)\og d n) 
We summarize this result in the following theorem: 

Theorem 9. Given the pairwise frequencies fx and a constant d, 1 < d < n, the multiple trees structure 
executes X in time 0(mH c (fx) log d n) and uses space 0(nd). 

We conjecture that no pointer-model structure has space 0(n) and search cost 0(H c (fx))- 
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